EmoChildRu: Emotional Child Russian Speech Corpus
نویسندگان
چکیده
We present the first child emotional speech corpus in Russian, called “EmoChildRu”, which contains audio materials of 3-7 year old kids. The database includes over 20K recordings (approx. 30 hours), collected from 100 children. Recordings were carried out in three controlled settings by creating different emotional states for children: playing with a standard set of toys; repetition of words from a toy-parrot in a game store setting; watching a cartoon and retelling of the story, respectively. This corpus is designed to study the reflection of the emotional state in the characteristics of voice and speech and for studies of the formation of emotional states in ontogenesis. A portion of the corpus is annotated for three emotional states (discomfort, neutral, comfort). Additional data include brain activity measurements (original EEG, evoked potentials records), the results of the adult listeners analysis of child speech, questionnaires, and description of dialogues. The paper reports two child emotional speech analysis experiments on the corpus: by adult listeners (humans) and by an automatic classifier (machine), respectively. Automatic classification results are very similar to human perception, although the accuracy is below 55% for both, showing the difficulty of child emotion recognition from speech under naturalistic conditions.
منابع مشابه
Emotion, age, and gender classification in children's speech by humans and machines
In this article, we present the first child emotional speech corpus in Russian, called “EmoChildRu”, collected from 3-7 years old children. The base corpus includes over 20K recordings (approx. 30 hours), collected from 120 children. Audio recordings are carried out in three controlled settings by creating different emotional states for children: playing with a standard set of toys; repetition ...
متن کاملRussian infants and children's sounds and speech corpuses for language acquisition studies
«INFANTRU» and «CHILDRU» are the first Russian child speech database. The corpus «INFANTRU» contains longitudinal vocalizations and speech records (n=2967) of 99 children from 3 mos to 36 mos. by long utterances sequences and separate utterances in different psychoemotional state of the child. The database “CHILDRU” contains the records (n=28079, 13956Mb) of 150 children’s speech at the age fro...
متن کاملCross-language perception of emotional children's speech in German and Russian
The paper concerns universal and language-specific aspects of emotion perception in children's speech. Three experiments were carried out to investigate differences and similarities in the assessment of emotions by German and Russian adult listeners. The corpora of German and Russian emotional children's speech were employed in the first and second experiments. In the third experiment German an...
متن کاملCoRuSS - a New Prosodically Annotated Corpus of Russian Spontaneous Speech
This paper describes speech data recording, processing and annotation of a new speech corpus CoRuSS (Corpus of Russian Spontaneous Speech), which is based on connected communicative speech recorded from 60 native Russian male and female speakers of different age groups (from 16 to 77). Some Russian speech corpora available at the moment contain plain orthographic texts and provide some kind of ...
متن کاملRussian Spontaneous Speech Rate - Based on the Speech Corpus of Russian Everyday Interaction)
The paper presents the results of the analysis of Russian spontaneous speech rate made on the basis of recordings of 40 speakers and their communicants. The impact of gender, age and place of residence on the speech rate is illustrated by the findings.
متن کامل